Morphological Cross Reference method for English to Telugu Transliteration
نویسندگان
چکیده
Machine Transliteration is a sub field of Computational linguistics for automatically converting letters in one language to another language, which deals with Grapheme or Phoneme based transliteration approaches. Several methods for Machine Transliteration have been proposed till date based on nature of languages considered, but those methods are having less precision for English to Telugu transliteration when both pronunciation and spelling of the word is considered. Morphological cross reference approach provides user friendly environment for transliteration of English to Telugu text, where both the pronunciation and the spelling of the word is taken into consideration to improve the precision of transliteration system. In addition to alphabet by alphabet transliteration, this paper also deals with whole document transliteration. Our system achieved an correct transliteration with an accuracy of '78%' of Transliteration for Vocabulary words.
منابع مشابه
Transliteration Based Text Input Methods for Telugu
Telugu is the third most spoken language in India and one of the fifteen most spoken languages in the world. But, there is no standardized input method for Telugu, which has a widespread use. Since majority of users of Telugu typing tools on the computers are familiar with English, we propose a transliteration based text input method in which the users type Telugu using Roman script. We have sh...
متن کاملOn the Utility of a Syllable-like Segmentation for Learning a Transliteration from English to an Indic Language
Source and target word segmentation and alignment is a primary step in the statistical learning of a Transliteration. Here, we analyze the benefit of a syllable-like segmentation approach for learning a transliteration from English to an Indic language, which aligns the training set word pairs in terms of sub-syllable-like units instead of individual character units. While this has been found u...
متن کاملKannada and Telugu Native Languages to English Cross Language Information Retrieval
One of the crucial challenges in cross lingual information retrieval is the retrieval of relevant information for a query expressed in as native language. While retrieval of relevant documents is slightly easier, analysing the relevance of the retrieved documents and the presentation of the results to the users are non-trivial tasks. To accomplish the above task, we present our Kannada English ...
متن کاملImproving Recall for Hindi, Telugu, Oromo to English CLIR
This paper presents the Cross Language Information Retrieval (CLIR) experiments of the Language Technologies Research Centre (LTRC, IIIT-Hyderabad) as part of our participation in the ad-hoc track of CLEF 2007. We present approaches to improve recall of query translation by handling morphological and spelling variations in source language keywords. We also present experiments using query expans...
متن کاملCross-Language IR at University of Tsukuba: Automatic Transliteration for Japanese, English, and Korean
This paper describes our cross-language information retrieval system for the NTCIR-4 CLIR task. Our system, which follows the query translation approach, uses a compound word translation and transliteration. Transliteration is effective if a query includes foreign words, such as technical terms and proper nouns, spelled out by phonetic alphabets. We apply our method, which was originally propos...
متن کامل